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ABSTRACT 

Prediction of book circulation is seen as one major 
factor for some xibraries in the development of an efficient book 
buying policy. The study approaches this problem of predicting book 
use from the conviction that there are certain characteristics 
associated only with high-use books, and certain other 
characteristics vhich are associated with little-used books^ The 
object, then, is to identify these high-use, and no-use or low-use 
indicators so that they may be built into a book selection policy. 
The suggested test indicators included such things as English 
language. Major trade publisher. University press, and Conference 
proceedings. Some of these factors did turn out to be 
high-performance indicators, but for various reasons, among vhich may 
be mentioned the fact that most applied to only a small proportion of 
the collection sampled, only the English language indicator could 
have any appreciable impact on the selection process. (Author) 



1 



ERIC 



FILMED FROM BEST AVAILABLE "COPY 



s£5 

J^^^ OS DEPARTMENT OF HEALTH 

* EDUCATION ft W€LPARE 

JSi^ NATiONALINSTlTUTE OF 

fXj THIS OOCUVENT HAS REFN 1 ILR — 73""03 

OUCEO FXACUV PECEIVEC V 
OO THE PE»S0MO» OttGANi/anON 0»*.^lN 

^ ATINGIT POINTS OF ViE'.V OH OPINIONS 

■ C^-J STATED DO NOT NECES^iARtLV REPHF 

SENT Of^FlCJAU NATiONAi. i SS r i T r C QT 

EDUCATION POSITION OH POL iCV 



Final Report 



Determination of Pre-Acqulsition 
Predictors of Book U&e 



Keri Weeks 



Institute of Library Research 
University of California . 



Berkeley, California ^UT20 



July 1973 



o 
o 

er|c 



r 



ABSTRACT 



Prediction of "book circulation is seen as- one major factor for 
some libraries in the development of an efficient book buying policy. 
The study approaches this problem of predicting book use from the con- 
viction that there are certain characteristics associated only with high- 
use books, and certain other characteristics which are associated with 
little-used books. The object, then, is to identify these high-use, and 
no-use or low-use indicators so that they may be built into a book 
selection policy. The suggested test indicators included such things 
as "English language," "Major trade publisher," "University press/- and 
"Conference proceedings." Some of these factors did turn out to be 
high-performance indicators, but for various reasons, among which may 
be mentioned the f^ct that most applied to only a small proportion of 
the collection sampled, only the "English language" indicator could have 
any appreciable impact on the selection process. 
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I. STATEMENT OF THE PROBLEM AND OBJECTIVES 



The inflationary rise of book prices and the decrease in amount 
, or purchasing power of acquisitions budgets force many libraries to lool: 
at the problem of how to select the books that represent the greatest 
value for that library. 

This notion of measuring book value is itself problematic. Should 
one measure the value of a book by the amount of money patrons are willing 
to pay for it? Or is the amoiint of UGe it gets a better measure of a 
book's value? Then there is the problem of how to measure amount or level 
of use: what, for example, does frequency of check-out really tell about 
book use? And how does one measure in-hcuse use? Moreover, there are = / 
certain collections (e.g,, large academic or research collections) where 
firequency of check-out, or circulation voliame, has little or nothing to 
do with book value. 

Nevertheless, in tills imperfect world compromise is often necessary, 
practical decisions must be made. And if one allows that some notion of 
book value may be obtained from a study of circulation statistics, or that 
the amount of use a book gets (as something to be estimated on the basis 
of its circulatipn history) can be taken as a measure of its value, then 
what is needed if book budgets are to be s^jent" in such a way as to bring, 
about the maximization of benefits to the community served by the collec- 
tion, is some way of predicting circulation before the book is acquired. 

This point is appreciated by the State of California Department of 
Finance report on "Library Cooperation: a Systems Approach to Inter- 
institutional Resource Utilization" [hereafter referred to as the Audi-.^ 
tors' Report ]i which recommends that in a proposed network of libraries 
each member should purchase low-use material only in its area(s) of 
specialization. Clearly, such a notion i presupposes a way of effec- 
tively predicting book use prior to acquisition. 

This, then, was the purpose of this study: to attempt to. develop 
a methodology or set of algorithms for predicting book use in a large 
\iniversity library so that the opportunity to purchase potential high- 
use items mifrht be compromised as little as possible by the expenditure 
of funds for other materials.. > ■ 

The major part of the study was done as part of a Design Seminar , 
conducted in the University of Ca!j.ifornia, Berkeley School of Librarian- 
ship by Professors R. Swank, M. Cooper, and C. Bourne, with later support 
provided by the Institute of Library Research. 

Specifically, two hypotheses wera to be tested: l) that there are 
certain indicators which are particularly characteristic of higu-use 
(=most valuable) and no-use (=least valuable) books, and 2) that these 
indicators could be used, at a stage prior to acquisition, to predict 
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future book use (=book value). 



Before going on to specify what predictors were tested, a brief 
review of previous work in this -^rea may be helpful. 
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II. SUMMARY OP PREVIOUS WORK 



A recent study "by Evans ha^ suggested that books selected by 
librarians tend to circulate more than those selected by fa^julty or 
those received on approval plans, but it fails to identify the methodology 
or criteria by which librarians do their selecting. 

Other studies have approached the problem of predicting book 
circule-tion from the standpoint that "reliable conclusions about book 
usage can be reached by an exaaiinati|on of the characteristics of books ^ 
themselves rather than of how they were acquired or who selected them." 
Fussier and Simon, for instance, were concerned with the identification 
of low-use items in a collection so that, upon their removal to storage, 
additional space is made available for hew acquisitidffs. Their study 
showe^ that a reliable indicator of future use of an item :s its past 
use. . 

Likewise, Trueswell has^^applied, in a whole series of articles, 
the techniques used for managing business inventories — especially the 
80/20 rule — ^to libraries for the ptCrposes of stack-thinning, core 
collection development, miiltiple copy determination, and determination 
of thco optimum size of a library's collection. The key statistic for 
all his strategies is also past use. 

But these approaches to circulation-prediction cannot easily be 
applied to the problem of selection of new books, f or which there is no 
"past use" statistic for that particular library. 

In a slightly different vein, McGrath has pointed up a connection 
between the subject of a book and its level of use: a book tends to 
circulate if its subject matter corresponds to a profile of the college 
or university based on courses of instruction offered there. But the 
effect of such an^ approach to the problem of book selection would seem 
to be limited to libraries serving institutions which emphasize a small 
number of highly specialized academic programs or disciplines-, e.g., a 
mining school, or seminary. 3 

Actually, circulation statistics, such as are available from the 
iioan Department of the Main Library of the University of California, 
Berkeley, can provide an even clearer picture of use-patterns. The Loan 
Department at UCB publishes an annual statistical summary which shows the 
activity of the various parts of the collection, or of the various classes 
of material (corresponding generally to subject areas). ^ But it is not 
enough to know that there is nearly twice the activity among items in the 
B - BJ class (philosophy, logic, metaphysics) as there is in the GN - GV 
class (anthropology ,J:.folklore, games); for although one might, rightly 
or. wrongly, be disposed to spend twice the amount of money for materials 
in the former subject areas, this still leaves the problem of which 
materials to purchase, in either area. 



III. SCOPE OF THE STUDY: POPULATION AND CONSTRAINTS 



Because this study^, in a sense, had been spawned by the Auditors' 
Report, it seemed appropriate to adopt their definitions and constraints 
where possible* Therefore, the universe we sampled consisted, like theirs, 
mostly of monographs and monographic serials; periodicals, theses, phono- 
graph recordings, maps, and art prints were excluded. Although the Audi- 
tors also attempted to exclude obvious gifts, no such attempt was made 
here. Furthermore we used the same algorithm devised by the Auditors 
and used by them in ^heir report to categorize the level of use of each 
item in our own sample (Figure l). 

There were, however, some constraints peculiar to our study. To 
begin, whereas the Auditors drew their sample from the collections of 
six California State University and College libraries, our sample was 
drawn entirely from the Loan Stack Collection of the Main Library of the 
University of California at Berkeley. 

Second, the population we sampled consisted, to be precise, not 
of all monographs and monographic serials, but rather, of those monographs 
and monographic serials which were on the shelves at the time of the^ data 
collection. (For more on this, see the section on SAMPLING TECHNIQUE.) 
Put briefly, bur sample was therefore biased against the set- of books not 
on the shelves at that time — which most likely means biased against high- 
use items. Probability of bias is estimated at a maximum of 8.6^, cor- 
responding to the largest percent of the collection in circulation at any 
one time [i.e., (estimated total ntunber of volumes in circulation) divided 
by (total number of volumes in collection)].* 

Third, while we did use the Auditors? algorithm to categorize 
level of use, nevertheless this categorization was made only on the evi- 
dence of extant transaction records. 'The circulation history of each 
book is recorded on d^te-due slips pasted on the end paper. When- a date- 
due slip is removed^ either by accident or design, the complete record 
does not survive. In fact, this happens often, though no statistics 
were kept on the number of occurrences within our sample. The point, 
however, is that when in our study part or all of the record might have 
perished, the algorithm for categorizing the level of book use* was strictly 
applied to the surviving data only, if any; no estimates were, attempted, though 



* 90,100/1,0^9,^7- I am indebted to Mr. Tim DeWolf of the Loan Depart- 
ment, Main Library, UCB, for this figure, which is based on total holdings 
of all Loan Department books, including those in Richmond storage. It 
does not include any items not recorded in thejloan files, such as de- 
partment charges and books waiting to be reshelved. 
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it might have seemed reasonable to automatically rank as high-use an item 
from which, say, two date-due slips had been removed. On page 25 of their 
Report, the Auditors acknowledge this problem of the risk of measurement 
error. They suggest, however, on the basis of a measurement of actual 
error that they were able to make, that in fact it amounted to only a 
small risk for their sample data. For us who vere not able to obtain a 
measurement of actual error, the risk amounts at worst to the fact that 
we may have applied, to some extent, a yet more conservative measure of 
book use; i.e., the number of low-use items grows even larger, while the 
number of high-use items decreases. This is a significant point because 
the result is a further refinement of the high-use group and perhaps yields 
a purer set of high-use predictors. 

Fourth, the "nc-use" category doubtless in'cludes a certain percentage 
of books only recently cataloged and, therefore, available to patrons for 
only a short time; even if one supposes that use equals value, it is a 
bit rigid to say that all "no-use" items are the books of least value. 
Actually, this must be true of the data collected by the Auditors them- 
selves, but they do not make a point of it .in their Report. The- presence 
of this subset of "no-use/potential high-use (or no-use)" items could compli 
cate matters by introducing what might be uncharacteristic predictors into 
the "no-use" bank. No statistics were kept on the number of sample items 
that fell in this category, but of the total number of monographs in the 
collection as of June 30 ^ 1972, approximately 2.3^ had been added within 
the fiscal year ending on that date. * 

Fifth, we did not consider interlibrary loan transactions, although 
they are so indicated on the date-due slips, in computing level of book 
use. Our reasoning was. that level of book use was proper-ly defined by 
local use.. No record was kept of the nurter of ILL transactions we so 
ignored. 

Finally, unlike the Auditors, we did not correct for unrecorded or 
in-house usage. The effect of this, to judge from the Auditors' analysis 
on pagea 36-UO of their Report, was a failure to correctly identify a 
certain percentage of "'true* high-use" and "'true' low-use" items, 
having misplaced them in a lower use category. 




* This percentage is based on figures for total "bound volumes" in the 
"main i.e., excluding Bancroft, Morrison, 

and all branch libraries. From the. "Size of the Libraries of the Uni- 
versity of California. Table II: Bound olumes and Current Serials 
(Berkeley Details), 27:35 (21 September 1972), p. 10. 
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Figure 1 

The Auditors' Book Classification Algorithm 
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IV. TEST PREDICTORS 



In order to qualify for testing, a "predictor" or "indicator" 
had to satisfy the basic requirement that it be among those of which 
a bibliographer or selections officer might reasonaoly be expected to 
have some knowledge from his reading of pre-publication notices and ad- 
vertisements or such other usual acquisition tools as library or publishers' 
catalogs. In addition, hunch, intuition, and examination of a preliminary 
sample of 50 items played a part in this selection process. This method 
eventually yielded the following list of original test indicators: 

1. English language: In the case of bi- or multi-lingual books, 
any item of which a substantial part of the text was in English 
was regarded as an English-language book. 

2. L ingle author — pesonal: Anonymous or pseudonymous works were 
not included. 

3. Multiple author — personal: Edited works were included, but 
translators were not regarded as contributing to multiplicity 
of authorship* 

h. Major trade publisher: Originally this was tentatively and 
arbitrarily taken to include only the following American pub- 
lishers: McGraw-Hill, Wiley, Prentice-Hall. Crowell/Collier/ 
MacMillan,. College & University Press, Norton, Van Nostrand, 
and Johnson Reprint Company. Later the list was expanded to 
include the following: Viking, Button, Harper, Putnam, Sheed 
& Ward, Dodd Mead, Lippincott, Holt, Scribner's, Knopf, Houghton 

\ Mifflin, and Appleton. For further details see the section 

1 on TEST RESULTS. 

5- University press: The sample was not restricted to American 
' university presses. 

6. Bibliography, catalog, abstract, annual review, and conference 
proceedings. "Conference proceedings" was later added as a 
\ separate indicator. For further details see the section on 
iTEST RESULTS, part 

i 7- Handbook, manual g'lide. 

i8. Illustrated: Photographs, drawings, etc.; these were to be 
distinguished from the types of graphic materials listed under 
iMicator 9. 

9. Maps, charts, diagrams, tables. 

10. Work complete in this voltune. 
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"Price" was not tested because this information was not available 
for the test either fr le public shelf lidt or from the book itself. 
Nor was "imprint date" ~^ed, although a significant percentage of books 
selected for academic libraries are non-current items. For example, 
according to statistics for the libraries of the nine University of Call-- 
fornia campuses, 31% of all monographic materials cataloged during the 
period 1963 to 196? bore a publication date of 19^9 or earlier. ^ It 
was nevertheless felt that these non-current items come especially re- 
commended by faculty or staff, in which case some of the uncertainty 
about future level of use is already settled. 
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V. SAMPLING TECHNIQUE AND DATA COLLECTION 



A* SAMPLE I 

To begin, a sample of 50 items was taken from the public shelf 
list. The shelf list was used as the data basis because, unlike the author/ 
title and subject catalogs, l) each item is entered only once and all items 
thus have an equal chance of being selected, and 2) the shell' list contains 
only items found in the Loan Stack Collection. 

Using a random number table, 50 drawers were chosen at random, and 
a card one^inch from the front of each draver was noted. Wnen 50 cards 
had been so selected, the books were retrieved from the shelves. Because 
of time constraints, no attempts were ms.de to recall or .search any items 
not on the shelves; instead, additional,, items were selected from the shelf 
list and retrieved. If a book was not on the shelf, another card was selected 
from the same drawer, this time one-inch from the back. If this second 
selection was not on the shelf, a third card was selected from the same 
drawer, this time two inches from the front, and so on, ' Eventually, 50 
sample items were so collected and classified according to level of use 
(high, low, or no), using the Auditors* algorithm. The ^'high-use" and 
"no-use" groups were then examined for promising predictors of book use. 
The data breakdown is given in the section on TEST RESULTS; it shows 
that four predictors appeared to be promising at this stage. An indicator 
was regarded as promising if the ratios, based on normalized scores, of 
high-use to no-use, and no-use to high-use were markedly different. The 
promising predictors, expressed in high-use terms were the following: 
"English language," "Major trade publisher ,-^I_"University press," and "Bib--- 
liography, catalog, abstract, anniial review, conference proceedings." 

B, SAMPLE II 

A second sample was then obtained used a less time-consuming method* 
In the first sample no significant effort was made to avoid prejudicing 
the data collection against books not on the shelves. This was taken one 
step further in the second sample: the shelf-list was by-passed altogether, ' 
and the following rule was used in selecting this next batch of sample 
items: For each stack. level, starting in the first aisle at the north- 
east end of each row of stacks, and afterwards every other aisle, select 
the 10th book on the kth shelf of the 2nd section facing east. This re- 
sulted in the collection of 221 items. A log was kept which recorded 
each item's call number, use-classification, and the presence or absence 
of the four predictors identified for fiarther testing by the first sample. 
This sample yielded results which led to testing of additional predictors 
in the subsequent sample. 

C. SAMPLE III 

Finally, a third sample was drawn according to a rule similar to . 
that used for collecting the second batch: for each stack level, starting 
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in the 2nd aisle at the northeast end of each row of stacks, and after- 
wards every other aisle, select the 5th book on the 3rd shelf of the 3rd 
section looking west. This yielded 192 sample items, and brought the 
total sample to kgs items. A log was kept to record the presence or 
absence of the original ten iiidicators of the first sample, plus one 
additional indicator. (For details see the following section on TEST 
RESULTS.) 
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VI* TEST RESULTS 



A. SAMPLE I 

The first sample (n=50) turned up 21 high-use items, 10 no-use 
items, and 19 low-use items. The test results are summarized in Table 
1. Because the frequencies were extremely disparate, particularly be- 
tween the high-use and no-use groups, the scores for the no-use group 
were normalized; that is, they were scaled upward as if there were as 
many no-use books as high-use boolfs in the sample. This facilitated 
meaningful comparison between the two groups. 

These points of comparisons reflect the fact that it was upon the 
two levels high-use and no-use that our interest centered. "Low-use" 
was expected to be a slush category which would supposedly contain a 
mixture of predictors of high-use and predictors of no-use. 

Of the. 21 high-use items, l6 (76%) were English-language materials; 
and of the 10 no-use items, 9 (90^) were non-English language. The most 
important figures were those concerned with a given indicator's comparison of 
the ratio of its high-use to no-use with the ratio of its no-use to high 
use. For the English language items, the ratio of high-use to no-use was 
nearly 8:1, while the ratio of np-use to high-use was 13:10Q. (Both 
ratios were based on the normalized no-use figure.) Likewise, for non- 
English language items the ratio of high-use to no-use was 26:100 (or 
0.26), while the ratio of no-use to high-use was 3.78. Prom this 
disparity between the two ratios, it appeared that "English language" 
might be a predictor of high book use and that "non-English language" 
might predict no-use. This early indication was confirmed throughout our 
sampling. 

In contrast, the other indicators in the first sample appeared to 
have little or no predictive value. In only two other cases — "Single 
author — personal" and "Work complete in this volume^^~did a predictor 
show up, in significant numbers; but the instances of its presence 
were evenly divided among the high-use and no-use categories. (See 
column of normalized scores. Table 1.) More often the case was one of 
low incidence of a predictor's presence, with the instances of its absence 
evenly divided between' the high-use and no-use levels. 

Within this negative context there were three indicators ("Major 
trade publisher," "University press," and. "Bibliography, etc.") where 
the difference between the ratio of high-use to no-use.'and the ratio of 
no-use to high-use seemed large enough to support some prospect of pre- 
dictive value. 

At this stage of data collection, the "Major trade publisher" 
included the following: McGraw-Hill, Wiley, Prentice-Hall, Crowell/ 
Collier/MacMillan^ College & University Press, Norton, Van Nostrand, , 
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and Johnson Reprint Company. This indicator showed up only 5 times 
among the 31 high-use and no-use items; of those 5 instances, h were 
high-use, A similar pattern of meager returns yet promising ratios was 
repeated for "University Press" {k high-Use :1 no-use) and for "Biblio- 
graphy, etc," (2 no-use:0 high-use). We next tested these h indicators 
in the second sample of 221 itemF., 

B, SAMPLE II 

The second sample yielded. 63 high-use and 87 no-use items, ' The 
striking feature of this sample was that "English language" continued to 
show as a strong high-use indicator. Table 2 contains the breakdown of 
these 221 sample items by indicator and use-category. Concerning the 
English language items, the ratio of high-use to no-use is nearly U:!, 
while the ratio of no-use to high-use is about 1:U, (These ratios were 
based on a nonr.alized score for the, high-use group, ) 

"Ijon-English language" was also an interesting indicator. Half of 
the non-English language items were no-use; about 32^ were low-use, and 
only lQ% were high-use. The implication is that If non-English language 
were used as a basis for rejection , only l8 out of 100 books so rejected 
would have represented eventual high-use items lost to the collection. 

Results for the other indicators tested in this sample were dis- 
appointing insofar as there were only negligible differences between 
the high-use: no-use and the no-use : high-use ratios, i \ 

Two additional points deserve comment, l) Clearly, _J:Jtie first sample 
of 50 was too small to reliably indicate which "predictors" might work. 
2) The low incidence of some indicators pointed out that even if such an 
indicator as "Major trade publisher" were found by itself to be a high- 
performance indicator, nevertheless it would be of limi'ted value to the 
bibliographer if books from major trade publishers constituted only a 
small percentage of the annual acquisitions. 1 

C, SAMPLE III 

We then tested a third sample of 192 items for a frequency distri- 
bution of the original ten indicators, plus one other indicator; the indi- 
cator "Bibliography, catalog, abstract, annual review, and conference pro- 
ceedings" was divided into two separate indicators: "Bibliography, cata- 
log, abstract, annual review'| a^id "Conference proceedings," We also ex- 
panded the number of "Major trade publishers" to include the following: 
Viking, Button, Haxper, Putnam, Sheed & Ward, Dodd Mead, Lippincott, Holt, 
Scribner's, Knopf, Houghton Mifflin, and Appleton. 

The third sample yielded 62 high-use items, 69 no-use items, and 6l 
low-use items. The results are given in Table 3 and are consistent with 
our earlier findings. Note, for example, that "English language" con-, 
tinues to indicate high-use, while "non-English language" is only slightly 
less certain as a predictor of no-use. Also, "Major trade publisher,'* 



13 



CVJ 
C\J 

•II 





CQ 






CO 




lA ON 


OO vo 




re 


hi 


• 


On 


O 
• 


ON ON 
• • 


^ ON 


o 


CO 


d 




H 


H 




H 




CQ 














Q) 
















CQ 
















(A 


(U 














IQ 


N] 


o 


CVJ 


O 






tn 




•H 




oo 


tA 


O 


o o 


t-r O 


CQ 


H 










• • 




O 


i 




rn 




H 


H H 


' H 


•H 


















u 


•H 














o 































CVJ 



CVJ 



Eh. 



ERIC 



H 

I 

CQ 

U 

O 

CQ 

o 

•H 



•p 

CQ 
0) 
Eh 

O 

O 
•H 

I 

•H 

•P 
CQ 
•H 

P 

J>» 
O 

a 

& 

p«4 







CQ 




o 


m 






7^ 
\ 


H 


CO 


CO 






O 
































0) 














CQ 


0) 


m 


^ CO 


CVJ 


•H 




CO 








H 


^ 


?i 


ON 


H vd 






O 


1 




CO 


CO 




O 












CQ 














•H 





















CQ 
•H 

•p 

•P 



0) 

CQ 

j:l 

<^ 
o 



a 

& 



o 



0) 

CO 

t 

o 



0) 
CQ 

J 

O 



0) 
CQ 
7^ 
I 

bO 

•rl 



CQ 
O 

t 

O 
•H 



CVJ ^ 



on CVJ 



■ bO 
W H 

bO CQ 

10} H 
H bO 



CQ I 

H O 
bO « 



CVJ ON 
VO 



o t-- 
co 



vo 



on 

CO 



I CVJ mo 
vo vo 



oo ON 



vo vo 



tA CVJ 
CO 



tn VO 
vo 



CO C3N 



a 

•8 











o • 










•p 












CO 
•H 










H 




CQ 










CO 




. §• 


t 




<U 














. fe 


0) 














>> 




ft H 




•P 




0} P 






•H 




^ 'H 






CO 




bO P 




^ 


1 


O 1 


U 








•H C3 


0 


0 


> 


0 


H 0 






•H 














•H 


1 








pq 



understood here in its expanded sense, looks very good as a predictor 
of high-use. However, "non-major trade publisher" was not a predictor 
of no-use. None of the other frequencies is remarkable'. 

D. COMBINED SAMPLE | 

The combined statistics for the three samples are simmiarized in 
T 61e h. Important to note is that the frequencies for the predictor 
"Major trade publisher" are not homogeneous;' because that predictor was 
defined differently for the third sample. Mso, separate statistics for 
"Conference proceedings" were collected only for the third sample. 
Finally, Table h also includes estimated percentages of the Loan Stack 
Collection represented by each of the predictors. These estimated per- 
centages were based on a random sample, where n=300, drawn from the public 
shelf list with the help of a random number table. 

E. COMBINATIONS OF INDICATORS 

The performance of our test indicators can be improved by taking 
the indicators in certain combinations, "able 5 displays some interesting 
combinations. Table 5 also shows that while combining predictors will 
improve their level of performance 5 combining them also reduces their 
impact. For example, "English language" taken with "Major trade pub- 
lisher" yields a high-use to no-use ratio of 17:1, but represents only 
6% of the collect ion. 
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VII- CONCLUSIONS 



Oxir data collection and analysis showed that there was, indeed, a 
correlation between "English language" and high-use, and "between "non- 
English language" and no-use. However, even if one supposes that English- 
language materials represent a good investment of book funds, there still 
remains the problem of deciding which English language items to acquire, 
as no library book budgets are sufficiently large to cover the purchase of 
all English-language items. 

One must also consider that none of these predictors is infallible. 
This is suggested. by the case of "English language," a moderately high- 
performance predictor that represents a la"^ge part of the collection. If 
one were to purchase all 178 English-language items included in our combined 
sample, the result would be the acquisition of 28 new items (about l6%) 
Which might never be used, or 8k items {kj%) which may never be high use. 

We also note a serious r athodological flaw. 9^ o± 178 English 
language items in our combined sample turned out to be high-use. This 
does not give reason to suppose that the total population of all English- 
language books would yield the ssune percentage of high-use items. For 
our sample does not represent that universe of all English-language books, 
but rather, a universe (large university research collection) where pre- 
selection has already taken place, although perhaps not on the basis of 
anticipated use. This point is equally applicable to all our test indi- 
cators and further reduces by an unknown factor whatever value they might 
be thought to possess in the book-selection. 

In summary, we found fo\ir potentially good indicators of high-use. 
In decreasing order of performance level, they were as follows: "Major 
trade publisher," "English language," "University press,'' and ''Muljsiple 
author — personal." Two good indicators of no-use from the many tested 
indicators were "non-English language" and "Bibliography, catalog, ab- 
stract, annual review, conference proceedings." However, we do not know 
how effective these vindicators would be for any material beyond that 
which has been pre-selected by the library selections staff. Further- 
more, if four of these six potentially good indicators were applied 
(^'English language" and "non-English language" excepted), they would 
correspond to only a small fraction (6 to 13%) of the collection as it 
is presently constituted. Unfortunately, there is no means of predicting 
the percentage of annual acquisitions (a more meaningful category in the 
context of this report) to which these indicators correspond. 
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